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ADAPTIVE CONTROL BASED ON 
RETROSPECTIVE COST OPTIMIZATION 

CROSS-REFERENCE TO RELATED 

APPLICATIONS 5 

This application claims the benefit of U.S. Provisional 
Application No. 61/201,035 filed on Dec. 5, 2008. The entire 
disclosure of the above application is incorporated herein by 
reference. 10 

This application is also related to U.S. Pat. No. 6,208,739, 
issued on Mar. 27, 2001, which is incorporated herein by 
reference. 

GOVERNMENT SUPPORT 15 

This invention was made with government support under 
Grant No. NNX08AB92A awarded by NASA. The govern- 
ment has certain rights in the invention. 


The present disclosure relates to methods and control sys- 
tems for using a digital adaptive control algorithm and, more 
particularly, relates to methods and control systems for using 25 
a digital adaptive control algorithm that are based on a retro- 
spective correction feedback filter. 

BACKGROUND AND SUMMARY 

30 

This section provides background information related to 
the present disclosure which is not necessarily prior art. This 
section also provides a general summary of the disclosure, 
and is not a comprehensive disclosure of its full scope or all of 
its features. 35 

Unlike robust control, which chooses control gains based 
on a prior, fixed level of modeling uncertainty, adaptive con- 
trol algorithms tune the feedback gains in response to the true 
dynamical system (or “plant”), and commands and distur- 
bances (collectively “exogenous signals”). Generally speak- 40 
ing, adaptive controllers require less prior modeling informa- 
tion than robust controllers, and thus can be viewed as highly 
parameter-robust control laws. The price paid for the ability 
of adaptive control laws to operate with limited prior model- 
ing information is the complexity of analyzing and quantify- 45 
ing the stability and performance of the closed-loop system, 
especially in light of the fact that adaptive control laws, even 
for linear plants, are nonlinear. 

Stability and performance analysis of adaptive control laws 
often entails assumptions on the dynamics of the plant. For 50 
example, a widely invoked assumption in adaptive control is 
passivity, which is restrictive and difficult to verify in prac- 
tice. A related assumption is that the plant is minimum phase, 
which may entail the same difficulties. In fact, sampling may 
give rise to nonminimum-phase zeros whether or not the 55 
continuous-time system is minimum phase, which must ulti- 
mately be accounted for by any adaptive control algorithm 
implemented digitally in a sampled-data control system. 
Beyond these assumptions, adaptive control laws are known 
to be sensitive to unmodeled dynamics and sensor noise, 60 
which necessitates robust adaptive control laws. 

In addition to these basic issues, adaptive control laws may 
entail unacceptable transients during adaptation, which may 
be exacerbated by actuator limitations. In fact, adaptive con- 
trol under extremely limited modeling information, such as 65 
uncertainty in the sign of the high-frequency gain, may yield 
a transient response that exceeds the practical limits of the 
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plant. Therefore, the type and quality of the available model- 
ing information as well as the speed of adaptation must be 
considered in the analysis and implementation of adaptive 
control laws. 

Adaptive control laws have been developed in both con- 
tinuous-time and discrete-time settings. In the present appli- 
cation we consider discrete-time adaptive control laws since 
these control laws can be implemented directly in embedded 
code for sampled-data control systems without requiring an 
intermediate discretization step that may entail loss of stabil- 
ity margins. 

According to some prior art, references on discrete-time 
adaptive control include a discrete-time adaptive control law 
with guaranteed stability developed under a minimum-phase 
assumption. Extensions based on internal model control and 
Lyapunov analysis also invoke this assumption. To circum- 
vent the minimum-phase assumption, the zero annihilation 
periodic control law uses lifting to move all of the plant zeros 
to the origin. The drawback of lifting, however, is the need for 
open-loop operation during alternating data windows. An 
alternative approach, is to exploit knowledge of the nonmini- 
mum-phase zeros. Knowledge of the nonminimum-phase 
zeros is used to allow matching of a desired closed-loop 
transfer function, recognizing that minimum-phase zeros can 
be canceled but not moved, whereas nonminimum-phase 
zeros can neither be canceled nor moved. Knowledge of a 
diagonal matrix that contains the nonminimum-phase zeros is 
used within a MIMO direct adaptive control algorithm. 
Finally, knowledge of the unstable zeros of a rapidly sampled 
continuous-time SI SO system with a real nonminimum- 
phase zero is used in some instances. 

Motivated by the adaptive control laws given in some 
instances, the goal of the present application is to develop a 
discrete-time adaptive control law that is effective for non- 
minimum-phase systems. In particular, we present an adap- 
tive control algorithm that extends the retrospective cost opti- 
mization approach. This extension is based on a retrospective 
cost that includes control weighting as well as a learning rate, 
which can be used to adjust the rate of controller convergence 
and thus the transient behavior of the closed-loop system. 
Unlike some instances, which use a gradient update, the 
present application uses a Newton-like update for the control- 
ler gains as the closed- form solution to a quadratic optimiza- 
tion problem. No off-line calculations are needed to imple- 
ment the algorithm or control system. A key aspect of this 
extension is the fact that the required modeling information is 
the relative degree, the first nonzero Markov parameter, and 
nonminimum-phase zeros, if any. Except when the plant has 
nonminimum-phase zeros whose absolute value is less than 
the plant’s spectral radius, we show that the required zero 
information can be approximated by a sufficient number of 
Markov parameters from the control inputs to the perfor- 
mance variables. No matching conditions are required on 
either the plant uncertainty or disturbances. 

In some embodiments, a goal of the present application is 
to develop the RCF adaptive control algorithm and demon- 
strate its effectiveness for handling nonminimum-phase 
zeros. To this end we consider a sequence of examples of 
increasing complexity, ranging from SISO, minimum-phase 
plants to MIMO, nonminimum-phase plants, including stable 
and unstable cases. We then revisit these plants under off- 
nominal conditions, that is, with uncertainty in the required 
plant modeling data, unknown latency, sensor noise, and satu- 
ration. These numerical examples provide guidance into 
choosing the design parameters of the adaptive control law in 
terms of the learning rate, data window size, controller order, 
modeling data, and control weightings. 
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According to the principles of the present teachings, a 
discrete-time adaptive control law or algorithm for stabiliza- 
tion, command following, and disturbance rejection that is 
effective for systems that are unstable, MIMO, and/or non- 
minimum phase. The adaptive control algorithm includes 
guidelines concerning the modeling information needed for 
implementation. This information includes the relative 
degree, the first nonzero Markov parameter, and the nonmini- 
mum-phase zeros. Except when the plant has nonminimum- 
phase zeros whose absolute value is less than the plant’s 
spectral radius, the required zero information can be approxi- 
mated by a sufficient number of Markov parameters. No 
additional information about the poles or zeros need be 
known. We present numerical examples to illustrate the algo- 
rithm’ s effectiveness in handling systems with errors in the 
required modeling data, unknown latency, sensor noise, and 
saturation. 

Further areas of applicability will become apparent from 
the description provided herein. The description and specific 
examples in this summary are intended for purposes of illus- 
tration only and are not intended to limit the scope of the 
present disclosure. 

DRAWINGS 

The drawings described herein are for illustrative purposes 
only of selected embodiments and not all possible implemen- 
tations, and are not intended to limit the scope of the present 
disclosure. 

FIG. 1 depicts a closed-loop system including adaptive 
control algorithm with the retrospective correction filter 
(dashed box) for p=l . 

FIG. 2 depicts roots of p 20 (q) for the stable, non-minimum- 
phase plant in Example 5.1. The dashed line denotes p(A)= 
0.95. Note that the roots outside p(A) are close to the outer 
nonminimum-phase zeros -1.5 and 1 .25. The remaining roots 
are either located at the origin or form an approximate ring 
with radius close to p(A). 

FIG. 3 depicts roots of p 25 (q) for the unstable, nonmini- 
mum-phase plant in Example 5.2. The dashed line denotes 
p(A)=l .4. Note that the root of p 25 (q) outside p(A) is close to 
the outer nonminimum-phase zero -1.5. However, the non- 
minimum-phase zero 1 .25 is not approximated by a root of 
p 25 (q). The remaining roots are either located at the origin or 
form an approximate ring with radius close to p(A). 

FIG. 4 depicts roots of p 25 (q) for the unstable, nonmini- 
mum-phase plant in Example 5.3. The dashed line denotes 
p(A)=0.95, where A is the dynamics matrix of a minimal 
realization of G zu . Note that the roots outside p(A) are close to 
the inner and outer nonminimum-phase zeros of G zu . The 
remaining roots are either located at the origin or form an 
approximate ring with radius close to p(A). 

FIG. 5 depicts closed- loop response of the unstable, mini- 
mum-phase, SISO plant in Example 7.1 using the nonmini- 
mum-phase-zero-based construction of B zu . The control is 
turned on at k=0. The controller order is n c =2 with parameters 
p=l and a(k)=10. 

FIG. 6 depicts closed- loop response of the unstable, mini- 
mum-phase, SISO plant in Example 7.2 with a step com- 
mand. The control is turned on at k=200. The controller order 
is n c =10 with parameters p=5, a(k)=5, and r=10 with B zu 
given by (56). 

FIG. 7 depicts closed-loop response of the stable, mini- 
mum-phase, SISO plant in Example 7. 1 with a step command 
and sinusoidal disturbance. The control is turned on at k=200. 
The controller order is n c =20 with parameters p=l, a(k)=50, 
and r=3 with B zu given by (56). 
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FIG. 8 depicts closed-loop disturbance-rejection response 
of the stable, minimum-phase, SISO plant in Example 7.4. 
The control is turned on at k=200. The controller order is 
n c =15 with parameters p=l , a(k)=25, and r=3 with B zu given 
5 by (56). 

FIG. 9 depicts time history of the components of 0(k) for 
the stable, minimum-phase, SISO plant in Example 7.4. The 
control is turned on at k=200. 

FIG. 10 depicts bode magnitude plot of the adaptive con- 
10 trailer in Example 7.4 at k=1000 samples. The adaptive con- 
troller places poles at the disturbance frequencies Q^Jt/10 
rad/sample and Q 2 13 jt/ 50 rad/sample. The controller magni- 
tude IG c (e ;Q )l is plotted for Q up to the Nyquist frequency 
1 5 Q Nyq =K rad/ sample . 

FIG. 11 depicts closed-loop disturbance-rejection 

response of the stable, nonminimum-phase, SISO plant in 
Example 7.5. The control is turned on at k=200. The control- 
ler order is n c l 5 with parameters p=l , a(k)=25, and r=7 with 
20 B zu given by (56). 

FIG. 12 depicts closed-loop disturbance-rejection 

response of the stable, nomninimum-phase, SISO plant in 
Example 7.5. The control is turned on at k=200. The control- 
ler order is n c 15 with parameters p=l, a(k)=2500, and r=l 
25 with B zu given by (56). Compared to FIG. 11 , the initial 
transient is reduced at the expense of convergence speed. 

FIG. 13 depicts closed-loop disturbance-rejection 

response of the unstable, minimum-phase, SISO plant in 
Example 7.6. The control is turned on at k=200. The control- 
30 ler order is n c =15 with parameters p=l, a(k)=25, and r=10 
with B zu given by (56). 

FIG. 14 depicts closed-loop disturbance-rejection 

response of the stable, minimum-phase, two-input, two-out- 
put plant in Example 7.7. The control is turned on at k=200. 
35 The controller order is n c =15 with parameters p=l, a(k)=l, 
and r=10 with B zu given by (56). 

FIG. 15 depicts closed-loop disturbance-rejection 

response of the stable, nomninimum-phase, two -input, two- 
output plant in Example 7.8. The control is turned on at 
40 k=200. The controller order is n c =15 with parameters p=2, 
a(k)= 1 , and r=8 with B zu given by (56). 

FIG. 16 depicts closed-loop disturbance-rejection 

response of the unstable, nonminimum-phase, two-input, 
two-output plant in Example 7.9. The control is turned on at 
45 k=200. The controller order is n c =10 with parameters p=l, 
a(k)=l, and i=10 with B zu given by (56). 

FIG. 17 depicts closed-loop performance comparison of 
the stable, nonminimum -phase, SISO plant in Example 7.5 
with multiplicative error in B. We take n c =10, p=l, and a(k) 
50 =1 000. The multiplicative error r \ , which is used to obtain the 
Markov parameters for B zu given by (56) with r= 10, is varied 
between 0.3 and 5 . The best performance is obtained for r\ =1 , 
which corresponds to the true value of B. 

FIG. 18 depicts closed-loop performance comparison of 
55 the stable, nonminimum -phase, SISO plant in Example 8.3 
with a multiplicative error in the nonminimum-phase zero 2. 
We take n c =10, p=l, and a(k)=25. The nonminimum-phase- 
zero multiplicative error r|, which is used to construct B zu 
given by (52), is varied between 0.75 and 2.5. The best per- 
60 formance is obtained for r|=l .05, which is close to the true 
value of the nonminimum-phase zero. 

FIG. 19 depicts closed-loop response of the unstable, mini- 
mum-phase, SISO plant in Example 7.6 with random white 
noise added to the measurement. The control is turned on at 
65 k=0. The controller order is n c =l 5 with parameters p=l , a(k) 
=25, and r=3 with B zu given by (56). The performance vari- 
able is degraded to the level of the additive sensor noise v(k). 
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FIG. 20 depicts closed-loop disturbance-rejection 
response of the stable, minimum-phase, SISO plant in 
Example 7.4, where both the actuator and sensor are saturated 
at ±2. The control is turned on at k=200. The controller order 
is n c =15 with parameters p=l, a(k)s25, and r=3 with B zu 
given by (56). The saturations degrade steady-state perfor- 
mance. 

FIG. 21 depicts closed-loop step-command- following 
responses of the stable, minimum -phase, SISO plant in 
Example 7 .4 with and without actuator saturation at ±0. 1 . The 
control is turned on at k=200. The controller order is n c =l 5 
with parameters p=l, a(k)=25, and r=3 with B zu given by 
(56). 

FIG. 22 depicts model reference adaptive control problem 
with performance variable z. 

FIG. 23 depicts closed-loop model reference adaptive con- 
trol of Boeing 747 longitudinal dynamics. The controller 
order is n c =10 with parameters p=l, a(k)=40, and r=10 with 
B zu given by (56). The controller is turned on at t=0 sec, and 
the performance variable converges within about 20 sec. 

FIG. 24 depicts closed-loop model reference adaptive con- 
trol of missile longitudinal dynamics. The control effective- 
ness X =\ , and thus the plant and reference model are identical. 
Therefore, the adaptive control input u ac =0. 

FIG. 25 depicts missile longitudinal dynamics with control 
effectiveness X=0.50 and adaptive controller turned off, that 
is, autopilot-only control. 

FIG. 26 depicts closed-loop model reference adaptive con- 
trol of missile longitudinal dynamics with control effective- 
ness E=0.50. The augmented controllers provide better per- 
formance than the autopilot-only simulation. 

FIG. 27 depicts closed-loop model reference adaptive con- 
trol of missile longitudinal dynamics with control effective- 
ness E=0.25. After a transient, the augmented controllers 
stabilize the system, whereas the autopilot-only simulation 
fails. Note that the system is stabilized despite the total con- 
trol input u reaching the actuator saturation level of ±30 deg. 

FIG. 28 depicts closed-loop model reference adaptive con- 
trol of missile longitudinal dynamics with control effective- 
ness X=0.25. The adaptive controller is initialized with the 
converged gains from the 50% control effectiveness case. The 
initial transient is reduced as compared with initializing the 
control gains to zero. In this case, the actuator saturation level 
is never reached. 

Corresponding reference numerals indicate corresponding 
parts throughout the several views of the drawings. 

DETAILED DESCRIPTION 

Example embodiments will now be described more fully 
with reference to the accompanying drawings. Example 
embodiments are provided so that this disclosure will be 
thorough, and will fully convey the scope to those who are 
skilled in the art. Numerous specific details are set forth such 
as examples of specific components, devices, and methods, to 
provide a thorough understanding of embodiments of the 
present disclosure. It will be apparent to those skilled in the art 
that specific details need not be employed, that example 
embodiments may be embodied in many different forms and 
that neither should be construed to limit the scope of the 
disclosure. 

The terminology used herein is for the purpose of describ- 
ing particular example embodiments only and is not intended 
to be limiting. As used herein, the singular forms “a”, “an” 
and “the” may be intended to include the plural forms as well, 
unless the context clearly indicates otherwise. The terms 
“comprises,” “comprising,” “including,” and “having,” are 
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inclusive and therefore specify the presence of stated fea- 
tures, integers, steps, operations, elements, and/or compo- 
nents, but do not preclude the presence or addition of one or 
more other features, integers, steps, operations, elements, 
5 components, and/or groups thereof. The method steps, pro- 
cesses, and operations described herein are not to be con- 
strued as necessarily requiring their performance in the par- 
ticular order discussed or illustrated, unless specifically 
identified as an order of performance. It is also to be under- 
10 stood that additional or alternative steps may be employed. 
Problem Formulation 

We begin by first considering the MIMO discrete-time 
system 


15 

x(k+ 1 ) =Ax(k)+Bu(k) +D { w (k ) , 

(1) 


y(k)= Cx(k) +D 2 w(k) , 

(2) 

20 

z(k)=E&k)+E 0 w(k), 

(3) 


where x(k)6ffi", y(k)eR 7 '. z(k)eR 4 , u(k)eR 4 , w(k)eR 7 ~, and 
ki^O. Our goal is to develop an adaptive output feedback 
controller under which the performance variable z is mini- 
mized in the presence of the exogenous signal w. In (l)-(3), w 
25 can represent either a command signal to be followed, an 
external disturbance to be rejected, or both. For example, if 
D x =0 and E o ^0, then the objective is to have the output E x x 
follow the command signal -E 0 w. On the other hand, if D^O 
and E o =0, then the objective is to reject the disturbance w 
30 from the performance variable E x x. The combined command- 
following and disturbance-rejection problem is addressed 
when D : and E 0 are suitably partitioned matrices. More pre- 
cisely, if D^IT^O], E 0 =[OE 02 L and 

35 


40 then the obj ective is to have E x x follow the command -E 02 w 2 
while rejecting the disturbance D : : Wj . Lastly, ifD x and E 0 are 
zero matrices, then the objective is output stabilization, that 
is, convergence of z to zero. We assume that (A, B) is stabi- 
lizable, and (A, C) and (A, E x ) are detectable, and that mea- 
45 surements of y and z are available for feedback. If the com- 
mand signal is included as a component of y, then the adaptive 
controller has a feedforward architecture. For disturbance- 
rejection problems, the controller does not require measure- 
ments of the external disturbance w. 

50 ARMAX Modeling 

Consider the ARMAX representation of (1), (3), given by 


n n n ( 4 ) 

55 z(k ) = 2 -aizik -0 + 2 piu{k -0 + 2 “ 0, 

/=i /=1 i = o 

where a l3 . . . , a w eM, (3 1; . . . , P w €M 4x/ “, and y 0 , . . . , y„€R 4x \ 
We define the relative degree d^l as the smallest positive 
60 integer i such that the ith Markov parameter 


Hi = EiA^B e IR /zX/ « 

65 

is nonzero. Note that, if d=l, then H 1 =p 1 , whereas, if d^2, 
then p x = . . . =P rf _ 1 =H 1 = . . . =^=0 and H d =$ d . 
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Letting the data window size p be a positive integer, we is the controller gain matrix, and the regressor vector (|)(k) is 
define the extended performance vector and given by 

R* A by 



z(k) 


u(k- 

-1) ‘ 

Z(*) = 


. Ui(k) = 




_z(k-p+ 1). 


u(k - 

-9c). 


where 


5 

(5) 


to 


m= 


y(k - 1) 

y{k-n c ) 
u{k- 1) 

u(k -n c ) 


e IR« c o«+^)_ 


(13) 


q c = n + p- 1. 


We define the extended control vector U(k)eR Pc/ " by 
15 


The data window size p has a small but noticeable effect on 
transient behavior. Now, (4) can be written in the form 

( 6 ) 

where W zvv eR p4xfe4+fe+1) H B ZM 6R p4x?A , and 20 

are given by 


U(k) = 


u(k - 1) 


u(k - p c ) 


(14) 


-<* l h z 

■■■ -OC n h z 0 i z xl z 

... 

xl z 

°/ z x/ z 






■■■ °/< 

xl z 


■■■ 0/ z x/ z -Qfi h z 

... -<*„/, z 

' fix ■■ 

■ Pn \xl u 

0 i z xl u 


0/ z x/ M 






0 i z xl u 


®l z xl u ■ ■ 

■ 0/ zX / M P\ 

Pn 



7o 

®t z xl w 

®t z xl w 


and 


Z(k - 1) 


*«(*) = 


z(k- p-n + 1) 
w(k) 


7n 0 i z xt w 


0t z xt w 7o 


w(k - p - n+ 1) 


0 


l z xl w 


7n 


(7) 


( 8 ) 


(9) 


Note that W zw includes modeling information about the plant 
poles and exogenous input path, whereas B zu includes mod- 
eling information about the plant zeros. Both W zw and B zu 45 
have block-Toeplitz structure. 

Controller Construction 

To formulate an adaptive control algorithm for (l)-(3), we 
use a strictly proper time-series controller of order n c such 50 
that the control u(k) is given by 


where p c =(L. Note that, if p c =<L, then U(k)=U 1 (k). From 
(1 1), it follows that the extended control vector U(k) can be 
written as 


Pc (15) 

U(k) = 2_ J Li6(k - i)(p{k - /), 

/=i 

where 


u(k) = 2 p^Mk - o + 2 &(*)>(* ~ o. 


( 10 ) 


55 


Li = 


h u 

0(pc _ OU x U _ 


; |J \Pclu+lu m 


(16) 


where, for all i=l, . . . , n c , P i (k)eR / * x '‘ and Q.fkjeR'^. The 

controller order n is determined by standard control guide- 1 „ 

lines in terms of stabilization and disturbance rejection. The - Nort, we define the retrospective performance vector 
control (10) can be expressed as 60 z ^y 


u{k) = 0(k)(f>(k), (1 1) Z(0, k) = W zw <p zw (k ) + B zu Ui. (k) - B zu [U(k ) - 0(9, *)], (17) 

where 

9{k) = [Qifkj ... Q„ c (k)P,(k) ... P„ c (k)] e lk : 1 ' where 0€ is the surrogate controller gain matrix, 

B^elf*** is the surrogate input matrix, and 
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-continued 




b(k) = 

2D T (k)R l (k)f(k) + 2[<p{k) <g> R T l2 (k)]f(k ) - 2[R 3 (k) <g> R 4 (k)]vec9(k) 


is the recomputed extended control vector. Substituting (6) 
into (17) yields 

Zm=Z(k)-B z JU(k)=U(Q, k)]. (19) iq 

Note that the expression for Z(0,k) given by (19) does not 
depend on either the exogenous signal w or the matrix W zw , 
which includes information about the open-loop poles as well 
as the transfer function from w to z. Hence, we do not need to 
know this model data, and, when w represents a disturbance, 15 
we do not need to assume that w is known. However, when w 
represents a command, then w can be viewed as an additional 
measurement y, and thus the controller has feedforward 
action. The matrix B zu is discussed further below. 

20 

Note that (19) can be rewritten as 


z(e, k) = f(k ) + D(k)vec9 , 
where 


/(*) i Z(k) -B m U(k) elR^, 

Pc 

D(k) = <P T (k - 0 0 (B zu L)i e U pl z xncl “ (l “ + 


vec is the column- stacking operator, and ^represents the Kro- 
necker product. 

Now, consider the retrospective cost function 


j(e , k) = Z (e, k)Ri (k)Z(e, k) + 2 Z (i 9 , k)R l2 (k)u(e, k + l) + (23) 

u T (e, k + 1 )R 2 (k)u(e, k + l) + tr[R 3 (k)(9 - 9(k)f R 4 (k)($ - 0(*))] 40 


c(k) = fmmfm + tw^w^mk)] 

(28) 

Since M(k) is positive definite, J(0,k) has the strict global 
minimizer 0(k+l) given by 

9(k + 1) = ~^vec~ l [ M~ l (k)b(k )] 

(29) 

Equation (29) is the adaptive control update law. Note that B zu 
(which appears in f(k) and D(k)) must be specified in order to 
implement (29). Furthermore, (29) requires the on-line inver- 
sion of a positive-definite matrix of size n c l z/ (l M +l > ,)xn c ,l M (l M + 
IV 

In the special case 


Hi (k) = lp, z , Ri 2 (*) = 0^ ( „, R 2 (k) = 0 luXlu , 

(30) 

R 3 (k) = a(k)/„ c( i u+ , yh R 4 (k) = /i u , 

(31) 

where a(k)>0 is a scalar, (26)-(28) become 


M(k)~D T (k)D(k)*a(k)I nM , uUy> , 

(32) 

b(k)=2D T (k\f(k)-2a(k)vece(k), 

(33) 

c(k)=f(km+o.(k)tr[e r (k)e(k)] 

(34) 


where R 1 (k)*W 2 * ph , R 12 (k)€ W‘ R 2 (k)e M 4x4 , R 3 (k) 
€ R 4 (k)6 M 4x4 , 


Riik) R n {ky 
Rnik) R 2 (k) _ 


is positive semi definite, R 3 (k) and R 4 (k) are positive definite, 
and 


u(9, k ) = 9(p{k). 


Substituting (20) into (23) yields 


Using the matrix inversion lemma it follows that 

M-\k)=a~\k)I nMlu+ir a^ 

(k)Y l D{k). (35) 

Consequently, in this case, the update law (29) requires the 
on-line inversion of a positive-definite matrix of size pl z xpl z . 
We use the weightings (30), (31) for all of the examples in the 
present application. The weighting parameter a(k) intro- 
duced in (31) is called the learning rate since it affects the 
convergence speed of the adaptive control algorithm. As a(k) 
is increased, a higher weight is placed on the difference 
between the previous controller coefficients and the updated 
controller coefficients, and, as a result, convergence speed is 
lowered. Likewise, as a(k) is decreased, convergence speed is 
raised. By varying a(k), we can effect tradeoffs between 
transient performance and convergence speed. 

We define the retrospective performance variable zeM 4 by 


j{9, /c) = ( vec9 ) M(k)vec9 + b T (k)vec9 + c(k ) 
where 


Z(k) = [h 0, zX/z ... 0 l xl ]Z(9(k), k). 


M{k) = D t (k)Ri (k)D(k) + 

2 D T (k)[f(k) <g> R l2 (k)] + [<t>(k)<f> T m <8> Ri{k) + R 3 (k) 0 R 4 (k) 


In the particular case z=y, using z in place of y in the regressor 
vector (13) yields faster convergence. Therefore, for z=y, we 
redefine (13) as 



11 


US 8,260,440 B2 


12 


m = 


z(k - 1 ) ' 

(37) 

G r ,Az) = y z-‘h, 

i = i 

z(k-n c ) 

5 

1 

u(k - 1 ) 

= 

z r 



1 



= - r Pr{z) 

u(k -n c ) 


z r 


( 46 ) 


The novel feature of the adaptive control algorithm given 
by (1 1 ) and (29) is the use of the retrospective correction filter 
(RCF) (19), as shown in FIG. 1 for p=l. RCF provides an 
inner loop to the adaptive control law by modifying the 
extended performance vector Z(k) in terms of the difference 
between the actual past control inputs U(k) and the recom- 
puted control inputs U(0,k). 

Markov-Parameter Polynomial 

By recursively substituting (1) into (3), it follows that z(k) 
can be represented by 

zi^E^xik-r^H^ik - 1 )+H 2 u(k-2)+ . . . +H r u(k- 

1)+ ■ ■ ■ +H ZWtr w(k-r), (38) 

where ri^d, H zw 0 4 E 0 , and, for all i>0, H zw ^aE 1 A i_ 1 D 1 . In 
terms of the backward-shift operator q _1 , (38) can be rewrit- 
ten as 

z(k)=E l A r q~ r x(k)+[H l q~ i +H 2 q~ 2 + . . . +H r q~ r ]u(k)+ 

\H zw , 0 +H ZW7l q~ l + . . . +H zwr q~ r ]w(k) 

Shifting (39) forward by r steps gives 

z(k+r)=E 1 A r x(k)+p r (q)u(k)+W r (q)w(k), 
where q is the forward-shift operator, 

W r {q)UI^, 0 cf+H^ l q'- l +H zm ^ 2 + . . . +H zw _ n 

and 

p r (q) kH i q r ^ i +H 2 <f~ 2 + . . . +H r . 

We call p 7 ,(q) the Markov-parameter polynomial. Note that 
p r (q) is a matrix polynomial in the MIMO case and a poly- 4Q 
nomial in the SISO case. Furthermore, since H 1 = . . . == H rf _ 1 =0 
when di= 2, it follows that, for all ri^di= 1 , p r (q) can be written 
as 

Pr(q)=H d q r - d +H d+l q r - d - l + . . . +H r . (43) 

The Markov-parameter polynomial p r (q) contains infor- 45 
mation about the relative degree d and, in the SISO case, the 
sign of the high-frequency gain, that is, the sign of H rf . We 
show below that p r (q) also contains information about the 
transmission zeros of G zu (z) aE : (zI-A) _ 1 B, which is given by 


(39) 


(40) 


30 


(41) 


35 


(42) 


G»fe) = 


Z n + aiz n ~ l + ... + a n 




i-/y 


(44) 


Consequently, the Markov-parameter polynomial p 7 .(q) is 
closely related to the truncated Laurent expansion of G zu . 
Approximation of Outer Nonminimum-Phase Zeros 

In the case of MIMO systems, p r (q) is a matrix polynomial 
and thus does not have roots in the sense of a polynomial. We 
therefore require the notion of a Smith zero. Specifically, z€(C 
is a Smith zero of p r (q) if the rank of p r (z) is less than the 
normal rank of p r (q), that is, the maximum rank of p r (£) taken 
over all ^eC- 

Definition 5.1 — -Let ^€(C be a transmission zero of G zu . 
Then, ^ is an outer zero of G zu if 1^1 =p(A). Otherwise, \ is an 
inner zero of G zu . 

The following result shows that the Smith zeros of the 
Markov-parameter polynomial p r (q) asymptotically approxi- 
mate each outer transmission zero of G zu . 

Fact 5.1 — Let IjeCbe an outer transmission zero of G zu . For 
eachr, let 9J 7 .a{^ 1? . . . , ^ } denote the set of Smith zeros 

of p r (q). Then, there exists a sequence . j 00 ^ that con- 
verges to ^ as r^oo. 

The following specialization to SISO transfer functions 
shows that the roots of p r (q) asymptotically approximate each 
outer zero of G zu . 

Fact 5.2 Consider 1 M =1 Z =1, and let §e(Cbe an outer zero of 
G zu . For each r, let %. . . . , ^> r ,r-d\ he the set of roots of 
p r (q). Then, there exists a sequence that converges 

to % as r— >oo. 

The following examples illustrate Fact 5 .2 by showing that, 
as r increases, roots of the Markov-parameter polynomial 
p r (q), and hence, roots of the numerator of the truncated 
transfer function G r ^. u , asymptotically approximate each 
outer nonminimum-phase zero of Y zu . The remaining roots of 
p r (q) are either located at the origin or form an approximate 
ring with radius close to p(A). These roots are spurious and 
have no effect on the adaptive control algorithm. 

Example 5.1 (SISO, nonminimum-phase, stable plant). 
Consider the plant G zu with d=2, H 2 =l, poles 0.5 ±0.5 j, 
-0.5±0.5 j, ±0.95, ±0.7 j, minimum-phase zeros 0.3±0.7 j, 
-0.7±0.3 j, and outer nonminimum-phase zeros 1.25, -1.5. 
Table 1 lists the approximated nonminimum-phase zeros 
obtained as roots of p r (fi) as a function of r. Note that as r 
increases, the outer nonminimum-phase zeros are more 
closely approximated by the roots of p r (fi)- See FIG. 2. 

TABLE 1 


In order to relate the transmission zeros of G zu to p r (q), the 
Laurent series expansion of G zu about z=oo is given by 


1 = 1 


(45) 


This expansion converges uniformly on all compact sub- 
sets of {z: lzl>p(A)}, where p(A) is the spectral radius of A. 
By truncating the summation in (45), we obtain the truncated 
Laurent expansion G r ^ u of G zw given by 


Approximated nonminimum-phase zeros obtained as roots 
of p r (q) as a function of r for the stable, nonminimum-phase 
plant in Example 5.1. As r increases, the outer zeros 
are more accurately modeled. 

r 

roots„^ ( Pr (q)) 

6 

(0.944, -1.537} 

8 

{1.170,-1.502} 

10 

{1.207, -1.498} 

15 

{1.240, -1.499} 

20 

{1.248,-1.500} 

25 

{1.250,-1.500} 
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Example 5.2 (SISO, nonminimum-phase, unstable plant). 
Consider the plant G zu with d=2, H 2 =l, poles 0.5±0.5 j, 
-0.5±0.5 j, ±0.7 j, -0.95, 1 .4 minimum-phase zeros 0.3±0.7j, 
-0.7±0.3 j, outer nonminimum-phase zero -1.5, and inner 
nonminimum-phase zero 1.25. FIG. 3 shows the roots of 5 
p 25 (q). Note that the root of p 25 (q) outside p(A) is close to the 
outer nonminimum-phase zero -1.5. However, the inner non- 
minimum-phase zero 1.25 is not approximated by a root of 
p 25 (q). The remaining roots are either located at the origin or 
form an approximate ring with radius close to p(A). 10 

Approximation of Inner Nonminimum-Phase Zeros 

Example 5 .2 illustrates that the roots of p r (q) approximate 
each outer nonminimum-phase zero of G zu . However, inner 
nonminimum-phase zeros of G zu are not approximated by 
roots of p r (q). To overcome this deficiency, we can use infor- 15 
mation about the plant’s unstable poles to create a modified 
Markov-parameter polynomial p r (q) whose roots approxi- 
mate each nonminimum-phase zero of G zu . For illustration, 
assume that the SISO plant G zu has a unique unstable pole 
Cwho se ab solute value is greater than all other poles of G zu . 20 
Then, we define 


GM = —GM 
z 


= GM--GM 
z 

i=d i=d 

i=d 


(47) 

25 


30 


remaining roots are either located at the origin or form an 
approximate ring with radius close to p(A). 

Construction of B zu 

We present four constructions for B zu based on the avail- 
able modeling information. 

Ek^-Based Construction 

If B zu given by (8) is known, then, with p c =q c , B zu can be 
chosen to be equal to B zu . In this case, U(k)=U 1 (k), and (17) 
becomes 


(49) 

This construction of B zu captures information about the 
relative degree d, the first nonzero Markov parameter (since 
Hrf = P<f), and exact values of all transmission zeros of G zW that 
is, both minimum-phase and nonminimum-phase transmis- 
sion zeros. 

Nonminimum-Phase-Zero-Rased Construction 

Consider 1 M =1 Z =1 and assume that H d and the nonmini- 
mum-phase zeros of G zu are known. Then we define the 
nonminimum-phase-zero polynomial N(q) to be the polyno- 
mial whose roots are equal to the nonminimum-phase zeros 
of G zu , that is, 




(50) 


where mi^O is the number of nonminimum-phase zeros in 
G zu , and (3 y , . . . , M. If m=0, that is, G zu is minimum phase, 

then N(q)=H rf . With p c =q c , the nonminimum-phase-zero- 
based construction of B zu is thus given by 


H x ... H d 


An ■■■ ® l z xl u ® l z xl u ■■■ ® l z xl u 


(51) 


0/ z x/ H ... 0/ 2 x/„ H\ ... H d A ... p m o /zX , H ... o /zX/m 


-continued 

i=d 

where, for i=l , 2, . . . , H z aH^H^ are the modified Markov 
parameters, and H o =0. By repeating this operation for each 50 
unstable pole of G zu , , the roots of the modified Markov-pa- 
rameter polynomial 


where H x = . . . = H rf _ 1 =0. This construction of B zu captures 
information about the relative degree d, the first nonzero 
Markov parameter, and exact values of all nonminimum- 
phase zeros of G zu . In the minimum-phase case, the only 
required modeling information is This construction of B zu 

can be extended to the MIMO case by replacing each mini- 
mum-phase zero in the Smith-McMillan form of G zu by a zero 
at z=0. 

r-MARKOV-Based Construction 


pm) = fid<r d + Hd + i <r d ~ l + ■ ■ ■ + hr (48) 

can approximate each nonminimum-phase zero of G zu . The 
following example illustrates this process. 60 

Example 5.3 (Ex. 5.2 with pole information) Reconsider 
Example 5.2, where the inner nonminimum-phase zero 1.25 
is not approximated by a root of p r (q). Using knowledge of 
the unstable pole 1 .4 to construct p r (q) given by (48), FIG. 4 
shows the roots of p 25 (q). Note that the roots outside p(A), 65 
where A is the dynamics matrix of a minimal realization of 
G zu , are close to the nonminimum-phase zeros of G zu . The 


Replacing k with k-1 in (4) and substituting the resulting 
relation back into (4) yields a 2-MARKOV model. Repeating 
this procedure r-1 times yields the r- MARKOV model of 

(1X3) 


n 

Z (k) = ^ a r ,,z{k - r - i + 1 ) + 
/= l 


( 52 ) 


r-l n 

HiU(k - i) + ^ P r ,iU{k -r-i + 


i=d 


/=! 


1 ) + 



15 

-continued 
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r - 1 n 

2 H zwM k “0 + 2 yrM k -r-i+ 1), 

/=0 /=1 

where, for i=l, . . . , n, the coefficients a ri € M. p r>i e and 
y rjZ € M /zX/ “' are given by 

A o A o A 

a u = -oc h Pi,i=Ph 7i,i=7«. 


a r,< = +^- 1 , 1+0 Pr,i = + /Ul,i+U 7r,i = a r-U7l,i 


16 

advantage in using B zu given by (56) rather than (55) is that 
p r2 , . . . , p need not be known. If, however, G zu has inner 
nonminimum-phase zeros and the unstable poles of G zu 
whose absolute values are greater than at least one inner 
nonminimum-phase zero are known, then we can replace the 
Markov parameters H 1; . . . , H r in (56) by the modified 
Markov parameters E^ , . . . , H r given in (47). If these poles are 

(53) 


7r-l,i+ 1, 


A 

a, -si = ttr-1,1 &L/ 1 , 


Pr,n = 


7r,n = «M,l7M- 


Note that |3 rl =H r and Yr,i = H zw , r . We represent (52) with w=0 20 
as the r- MARKOV transfer function 

1 (54) 

( ' r ’ zu{Z) - j«»-l+Qr r _ lZ »-l+...+Of r| „ 25 

+ ... + H r . t f + H r f- 1 + ft, 2 z»- 2 + ...+/}„) 


The system representation (54) is nonminimal since its 
order is n+r-1, and thus (54) includes poles that are not 
present in the original model. Furthermore, note that the 
coefficients of the terms z w+r_2 through z" in the denominator 
are zero. These facts are irrelevant for the following develop- 
ment. Using the numerator coefficients of (54), the 35 
r-MARKOV-based construction of B zu with p c =q c +r-l is 
given by 


Hi ■ 

■■ H r p r ,2 . 

■ ■ Pr,n 0 i zX i u . 

■ ■ 0/ x/ 

« z xt M 

(55) 4Q 

0 l z xl u 



■■ 0/ z x/ H 


0/ 2 X/ m 

■■ 0, zX , H Hi . 

.. H r p r ,2 ■ 

■■ fir* _ 

45 


This construction of B zu captures information about the 
relative degree d, the first nonzero Markov parameter, and 
exact values of all transmission zeros of G zu , that is, both 
minimum-phase and nonminimum-phase transmission zeros. 

Markov-Parameter-Based Construction 


Using the numerator coefficients of (46), the Markov-pa- 
rameter-based construction of B zu with p c =q c +r- 1 is given by 


55 


(56) 


B Z u — 




60 


The Markov parameters are the numerator coefficients of a 
truncated Taurent series expansion of G zu about z=oo. The 
Markov parameters contain information about the relative 65 
degree d and, as shown by Fact 5.2 for the SISO case, approxi- 
mate values of all outer nonminimum-phase zeros of G zu . The 


not known, then B zu can be chosen to be either B ZLO the 
nonminimum-phase-zero form (51), or the r-MARKOV form 
(55). 

Note that, if the order n of the system is known and 2n+l 
Markov parameters are available, then a state-space model of 
the system can be reconstructed by using the eigensystem 
realization algorithm. However, the examples considered in 
sections below use substantially fewer Markov parameters. 

NUMERICAL EXAMPLES 
Nominal Cases 

We now present numerical examples to illustrate the 
response of the RCF adaptive control algorithm under nomi- 
nal conditions. We consider a sequence of examples of 
increasing complexity, ranging from SISO, minimum-phase 
plants to MIMO, nonminimum-phase plants, including stable 
and unstable cases. Each SISO example is constructed such 
that H^l . All examples assume y=z with (|)(k) given by (37), 
and, in all simulations, the adaptive controller gain matrix 
0(k) is initialized to zero. Unless otherwise noted, all 
examples assume x(0)=0. 

Example 7.1 (SISO, minimum-phase, unstable plant, sta- 
bilization) Consider the plant G zu with d=l , poles 0, 1.5, and 
inner nonminimum-phase zero -1.25. For stabilization, we 
take D : and E 0 to be zero matrices. Let B zu be given by (51), 
which is constructed using the first nonzero Markov param- 
eter H 1 =l and the location of the nonminimum-phase zero 
-1 .25, that is, N(q)=q+1.25. Wetaken c =2, p=l, and a(k)=10. 
The closed-loop response is shown in FIG. 5 for x(0)=[0.1 
0.4] r . 

Example 7.2 (SISO, minimum-phase, unstable plant, com- 
mand following). Consider the double integrator plant G zu 
with d=3, poles 0.5 ±0.5 j, -0.5±0.5 j, 1, 1, and a minimum- 
phase zeros 0.3 ±0.7 j, 0.5. We consider a command-following 
problem with step command w(k)=l . With the plant realized 
in controllable canonical form, we take D x =0 and E 0 =- 1 . We 
taken c =10, p=5, a(k)s5, and r=10 with B zw given by (56). The 
closed-loop response is shown in FIG. 6. 

Example 7.3 (SISO, minimum-phase, stable plant, com- 
mand following and disturbance rejection). Consider the 
plant G zu with d=3, poles 0.5±0.5 j, -0.5±0.5 j, ±0.9, ±0.7 j, 
and minimum-phase zeros 0.3±0.7 j, 0.7±0.3 j, 0.5. We con- 
sider a combined step-command-following and disturbance- 
rejection problem with command Wj_ and disturbance w 2 
given by 
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wi(k) 


' 5 

(57) 


' s 2 - 0.5s z 4 - 0.1s 3 - 0.22s 2 + 0.59s - 0.29 ' 

w{k) = 

w 2 (k) 

= 

sinfti k 

' 

G ZU (Z) = 

Di(z) Di(z) 

s-0.5 s 3 -l.U 2 + 0.88s -0.29 


J L Ddz) D { (Z) 

where Q^Jt/10 rad/sample. With the plant realized in con- 
trollable canonical form, we take where 


ro oi 

Di = and£b = [-l 0 ]. 


The disturbance, which is not matched, is assumed to be 15 
unknown, and the command signal is not used directly. We 
take n c =20, p=l , a(k)=50, andr=3 with B zu given by (56). The 
closed-loop response is shown in FIG. 7. 

The following examples are disturbance-rejection Simula- ^ 
tions, that is, E o =0, with the unknown two-tone sinusoidal 
disturbance 


D V {Z) = s 5 + 0 .Is 4 + 0.09s 3 - 0.401s 2 - 0.196s - 0.2205, d= 1, 


Consequently, G zu has poles -0.5±0.5 j, 0.9, ±0.7 j, -0.5±0.5 
j, 0.9, ±0.7 j and minimum-phase transmission zeros 0.3 ±0.7 
j, 0.5, 0.5. We take n c =15, p=l, a(k)sl, and r=10 with B zu 
given by (56). The closed-loop response is shown in FIG. 14. 

Example 7.8 (MIMO, nonminimum-phase, stable plant, 
disturbance rejection) Consider the two-input, two-output 
plant 


I" sinfliA: 1 (58) 

w(k) = 25 

[ -1.5sinfl 2 /c J 

where Q^Jt/10 rad/sample and Q 2 =l 3 jt/50 rad/sample. With 
each plant realized in controllable canonical form, we take 


G»fc) = 


s 2 - 0.5s z 2 -z - 2 
D { (z) D 2 (z ) 

s-0.5 s-2 

Dy{z) D 2 {z) 


D i = 



where D^z) is in given n Example 7.7, D 2 (z)z 3 -0.2z 2 + 
0.34z+0.232, d=l, and 


and, therefore, the disturbance is not matched. 

Example 7.4 (SISO, minimum-phase, stable plant, distur- 
bance rejection) Consider the plant G zu with d=3, poles 
0.5±0.5 j, -0.5±0.5 j, ±0.9, ±0.7 j, and minimum-phase zeros 
0.3±0.7 j, -0.7±0.3 j, 0.5. Taking n c =15, p=l, a(k)s25, and 
r=3 with B zu given by (56), the closed-loop response is shown 
in FIG. 8. The control algorithm converges (see FIG. 9) to an 
internal model controller with high gain at the disturbance 
frequencies, as seen in FIG. 10. 

Example 7.5 (SISO, nonminimum-phase, stable plant, dis- 
turbance rejection) Consider the plant G zu with d=3, poles 
0.5±0.5 j, — 0. 5±(). 5 j, ±0.9, ±0.7 j, minimum-phase zeros 
0.3 ±0.7 j, -0.7 ±0.3 j, and outer nonminimum-phase zero 2. 
We take n c =15, p=l, r=7, and a(k)s25. The Markov -param- 
eter polynomial used to construct B zu as in (56) is given by 
p 7 (q)=q 4 -1.2q 3 -0.96q 2 -0.56q-0.75, with roots 0.01±0.71 j, 
-0.77, 1 .94. Note that the root 1 .94 approximates the zero 2. 
The closed-loop response is shown in FIG. 11. 

To illustrate the effect of the learning rate a(k), the closed- 
loop response is shown in FIG. 12 for a(k)s2500 and all other 
parameters unchanged. Note that, with a(k)s2500, the initial 
transient is reduced at the expense of convergence speed. 

Example 7.6 (SISO, minimum-phase, unstable plant, dis- 
turbance rejection). Consider the plant G zu with d=3, poles 
0.5±0.5 j, -0.5±0.5 j, ±1 .04, 0. 1 ± 1 .025 j, and minimum-phase 
zeros 0.3±0.7 j, -0.7±0.3 j, 0.5 . We take n c =l 5, p=l , a(k)s25, 
and r=10 with B zu given by (56). The closed-loop response is 
shown in FIG. 13. 

Example 7.7 (MIMO, minimum-phase, stable plant, dis- 
turbance rejection). Consider the two-input, two-output plant 


Consequently, G zu has poles -0.5±0.5 j, 0.3±0.7 j, ±0.7 j, 
40 -0.4, 0.9, minimum -phase transmission zero 0.5, and outer 
nonminimum-phase transmission zero 2 . We take n c =l 5 , p=2, 
a(k)sl, and r=8 with B zu given by (56). The closed-loop 
response is shown in FIG. 15. 

Example 7.9 (MIMO, nonminimum-phase, unstable plant, 
45 disturbance rejection) Consider the two -input, two -output 
plant 


50 


G ZU (Z) = 


z 2 - 0.5s z 2 - z - 2 
D 3 (z) D 4 (z ) 

s-0.5 s-2 

D 3 (z) D 4 (z ) 


55 where 


D 3 (s) = s 5 - 1.1s 4 + 1.731s 3 - 1.494s 2 + 0.608s - 0.4679, 
D 4 (z ) = s 3 + 1.4s 2 + 0.9s + 0.2, d = 1, and Hi = [ 1 


Consequently, G zu has poles -0.5±0.5 j, ±0.7 j, 0.1±1.025 j, 
-0.4, 0.9, minimum -phase transmission zero 0.5, and outer 
65 nonminimum-phase transmission zero 2 . We take n c =l 0, p=l , 
a(k)sl, and r=10 with B zu given by (56). The closed-loop 
response is shown in FIG. 16. 
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NUMERICAL EXAMPLES 
Off-Nominal Cases 

We now revisit the numerical examples of the preceding 
section to illustrate the response of the RCF adaptive control 
algorithm under conditions of uncertainty in the relative 
degree and Markov parameters as well as measurement noise 
and actuator and sensor saturation. In each example, the adap- 
tive controller gain matrix 0(k) is initialized to zero. Unless 
otherwise noted, all examples assume x(0)=0. 

Example 8.1 (Ex. 7.5 with Markov-parameter multiplica- 
tive error) Reconsider Example 7.5 with Markov-parameter 
multiplicative error. For controller implementation, we use 
the estimate B at|B, where is varied between 0.3 and 5. 
For i=l , . . . , r, the estimated Markov parameters H Z =CA Z-1 B 
are used to construct B zu given by (56). Taking n c =15, p=l, 
r=10, and a(k)=1000, the closed-loop performance is com- 
pared in FIG. 17. In each case, the control is turned on at k=0, 
and the performance metric is given by 


k 0 = miii/c > 9 : \z(k - 01 < 0.01 


that is, k 0 is the minimum time step k such that the average of 
{ lz(k— i) I } z=0 9 is less than 0.01. FIG. 17 shows that the best 
performance is obtained for r|=l, which corresponds to the 
true value of B. As r\ is decreased, convergence slows signifi- 
cantly. 

In the case where the sign of the first nonzero Markov 
parameter (the sign of the high-frequency gain) is wrong, that 
is, H 3 =-J 3 , the simulation fails. These simulations suggest 
that performance degradation due to an unknown scaling of 
the Markov parameters provides a useful measure of adaptive 
gain margin. These findings are consistent with the adaptive 
gain-margin results. 

Example 8.2 (Ex. 7.5 with unknown latency). A known 
latency of 1 steps can be accounted for by replacing d by d+1 
in the construction of B zu . However, we now assess the effect 
of unknown latency in Example 7.5, which is equivalent to 
uncertainty in the relative degree d. The system has relative 
degree d=3. For controller implementation, we use the erro- 
neous estimate d of d and take n c =15, p=l, a(k)=1000, and 
r=10 with B zu given by (56). Letting d be either 2, 3, 4, 5, or 
6, Table 2 compares both the performance metric (59) and the 
maximum value of I z(k) I for each estimate d of d. In each 
case, the control is turned on at k=0. The best performance is 
obtained for d=d=3 . 

TABLE 2 


Closed-loop performance comparison of the stable, nonminimum-phase, 
SISO plant in Example 1.5 with unknown latency. 


d 

k 0 

max 1 z(k) 1 

2 

1870 

12.3 

3 

531 

9.4 

4 

847 

8.5 

5 

4633 

10.9 

6 

11660 

3.2 x 10 9 


For controller implementation, we use the erroneous estimate d of d and take n c = 1 5, p = 1 , 
a(lc) ■ 1000, and r = 1 0 with F zu given by (56). The best performance is obtained for d = d 
= 3 
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These simulations show the sensitivity of the adaptive con- 
troller to unknown errors in the relative degree d, which 
provides a useful measure of adaptive phase margin. 

Example 8.3 (Sensitivity to nonminimum-phase-zero 
5 uncertainty). Consider the plant G zu with d=l , H^l , poles 0, 
0.5, and outer nomninimum-phase zero 2. The plant is subject 
to disturbance w(k) given by (58), and thus, with the plant 
realized in controllable canonical form, we take and 

E o =0. Furthermore, we assume y=z and let (|)(k) be given by 
to (37). To illustrate the sensitivity of the adaptive control algo- 
rithm to knowledge of the nomninimum-phase zero, we let 
B zu be given by (51), which is constructed using the first 
nonzero Markov parameter H 1 =l, the nonminimum -phase 
zero 2, and a multiplicative error r\€ M, that is, N(q)=q-2r| . We 
15 vary r\ between 0.75 and 2.5 with n c =l 0, p=l , and a(k)=25. A 
closed-loop performance comparison is shown in FIG. 18 . In 
each case, the control is turned on at k=0, and the performance 
metric is given by (59). The best performance is obtained for 
r|=l .05, which is close to the true value of the nonminimum- 
20 phase zero. Note that the adaptive control algorithm is more 
robust to larger values of r\ than smaller values. 

Example 8.4 (Ex. 7.6 with stabilization and noisy measure- 
ments). Reconsider Example 7.6 with no commands or dis- 
turbances. For stabilization, we take D x and E 0 to be zero 
25 matrices . To assess the performance of the adaptive algorithm 
with added sensor noise, we modify (2) and (3) by 

y(k)=z(k) =E l x(k)+E 0 w(k)+v(k) , (60) 

where v(k)€ M' is Gaussian white noise with mean v=2 and 
standard deviation cr=0. 1 . We take n c = 1 5 , p= 1 , a(k)=2 5 , and 
r=3 with given by (56). For the initial condition 
x(0)=[=0.43 -1.67 0.13 0.29 -1.15 1.19 1.19 -0.04] r , 

the closed-loop response is shown in FIG. 19 . 

Example 8.5 (Ex. 7.4 with actuator and sensor saturation). 
35 Reconsider Example 7.4 with the additional assumption that 
both the control input and sensor measurement are subject to 
saturation at ±2. We take n c =15, p=l, a(k)=25, and r=3 with 
given by (56). The closed-loop response shown in FIG. 20 
indicates that the saturations degrade steady-state perfor- 
40 mance. 

Example 8.6 (Ex. 7.4 with command following and actua- 
tor saturation). Reconsider Example 7.4 with step command 
given by w(k)=l. With the plant realized in controllable 
canonical form, we take D x =0 and E 0 =-l . Taking n c =l 5, p=l , 
45 a(k)=25, and r=3 with B zu given by (56), the closed-loop 
responses are shown in FIG. 21 with and without actuator 
saturation at ±0.1 . With actuator saturation, the performance 
variable reflects the capability of the saturated control. 
Model Reference Adaptive Control 
50 Model reference adaptive control (MRAC) , as illustrated in 

FIG. 22 , is a special case of (l)-(3), where 


is the difference between the measured output y l of the plant 
G and the output y m of a reference model G w . For MRAC, the 
exogenous command w is assumed to be available to the 
60 controller as an additional measurement variable y 2 . Unlike 
standard MRAC methods, retrospective cost adaptive control 
does not depend on knowledge of the reference model G m . 

We now present numerical examples to illustrate the 
response of the RCF adaptive control algorithm for model 
65 reference adaptive control (see FIG. 22). Unless otherwise 
noted, the adaptive controller gain matrix 0(k) is initialized to 
zero. 
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Boeing 747 Longitudinal Dynamics 

Consider the longitudinal dynamics of a Boeing 747 air- 
craft, linearized about steady flight at 40,000 ft and 774 ft/sec. 
The inputs to the dynamical system are taken to be elevator 
deflection and thrust, while the output is the pitch angle. The 5 
continuous-time equations of motion are thus given by 


" u 


■ -0.003 

0.039 

0 

-0.322' 

' u ' 

w 


-0.065 

-0.319 

7.74 

0 

w 

q 


0.020 

-0.101 

-0.429 

0 

q 

_e_ 


0 

0 

1 

0 

9 _ 


(61) 

10 


0.010 1 
-0.180 -0.040 T(5 e l 
-1.160 0.598 [6t\ 

0 0 


y = 


u 


0 0 0 1 
0 0 0 0 



9 


(62) 2Q 


z = yi-y m . 


(63) 


where w is the exogenous command and y m is the output of 
the reference model 


y OT (s) 0.0131 

W(s) = s 2 +0.165 + 0.0131' 


30 

(64) 


We discretize (61)-(64) using a zero-order hold and sam- 35 
pling time T^=0.0 1 sec. The reference command is taken to be 
a 1 -deg step command in pitch angle. The controller order is 
n c = 10 with parameters p=l, a(k)=40, and r= 10 with B zu given 
by (56). The closed-loop response is shown in FIG. 23 for 
zero initial conditions. 40 

Missile Longitudinal Dynamics 

We now present numerical examples for MRAC of missile 
longitudinal dynamics under off-nominal or damage situa- 
tions. The missile longitudinal plant is derived from the short 
period approximation of the longitudinal equations of 45 
motion, given by 


r -1.064 1 1 [ -0.25 1 (65) 

l 290.26 0 f + A l -331.4 J“’ 

[- 12 0 334 ^ + a [- 13 - 51 ]«, (66) 


where 



reference model G m consists of the basic missile longitudinal 
plant with X=1 and the classical three-loop autopilot. An 
actuator amplitude saturation of ±30 deg=±0.524 rad is 
included in the model, but no actuator or sensor dynamics are 
included. 

The goal is to have the missile follow a pitch acceleration 
command w consisting of a 1 -g amplitude 1 -Hz square wave. 
The performance variable z is the difference between the 
measured pitch acceleration A z and the reference model pitch 
acceleration A z *, that is, 


Z = A z -A*. 

The closed-loop response is shown in FIG. 24 for X =\ . Since 
the plant and reference model are identical in the nominal 
case, the adaptive control input u^ c =0. 

All of the following examples use zero initial conditions 
and the same adaptive controller parameters. The adaptive 
controller is implemented at a sampling rate of 300 Hz. We 
take n c =3, p=l, and r=20 with Q zu given by (56). A time 
varying learning rate a(k)=75k+l is used such that, initially, 
controller adaptation is fast, and, as performance improves, 
the adaptation slows. The learning rate is identical for each 
simulation. System identification using the observer/Kalman 
filter identification (OKID) algorithm is used to obtain the 20 
Markov parameters required for controller implementation. 
The offline identification procedure is performed with a 
nominal simulation (X=l) by injecting band-limited white 
noise at the adaptive controller input u ac and recording the 
performance variable z while the autopilot is in the loop. No 
external disturbances are assumed to be present during the 
identification procedure. 

Example 9.1 (50% control effectiveness). Consider 
X=0.50. FIG. 25 shows simulation results with the adaptive 
controller turned off, that is, autopilot-only control. Now, 
with the autopilot augmented by the adaptive controller, 
simulation results are shown in FIG. 26. After a transient, the 
augmented controllers provide better performance than the 
autopilot-only simulation. 

Example 9.2 (25% control effectiveness). Consider 
X=0.25. With the adaptive controller turned off, that is, auto- 
pilot-only control, the simulation fails. With the autopilot 
augmented by the adaptive controller, simulation results are 
shown in FIG. 27. After a transient, the augmented controllers 
stabilize the system, whereas the autopilot-only simulation 
fails. 

FIG. 27 shows that the total control input u reaches the 
actuator saturation level of ±30 deg. To reduce the initial 
transient, we initialize the adaptive controller with the con- 
verged control gains 0 from the 50% control effectiveness 
case. As shown in FIG. 28, the initial transient is reduced as 
compared with initializing the control gains to zero. In this 
case, the actuator saturation level is not reached. 

CONCLUSION 


and Le(0,l] represents the control effectiveness. Nominally, 
X=l. 

The open-loop system (65), (66) is statically unstable. To 
overcome this instability, a classical three-loop autopilot is 
wrapped around the basic missile longitudinal plant. The 
adaptive controller then augments the closed-loop system to 
provide control in off-nominal cases, that is, when L<1 . The 
autopilot and adaptive controller inputs are denoted u ap and 
u ac , respectively. Thus, the total control input u=u ap +u ac . The 


We presented the RCF adaptive control algorithm, system, 
and method and demonstrated its effectiveness in handling 
nonminimum-phase zeros through numerical examples illus- 
trating the response of the algorithm under conditions of 
uncertainty in the relative degree and Markov parameters, 
measurement noise, and actuator and sensor saturations. 
Bursting was not observed in any of the simulations. We also 
suggested metrics that can serve as gain and phase margins 
for discrete-time adaptive systems. Development of 


60 


65 
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Lyapunov-based stability and robustness analysis of the RCF 
adaptive control algorithm as well as development of a theo- 
retical foundation for analyzing broadband disturbance-re- 
jection properties of the controller is anticipated. 

The foregoing description of the embodiments has been 
provided for purposes of illustration and description. It is not 
intended to be exhaustive or to limit the invention. Individual 
elements or features of a particular embodiment are generally 
not limited to that particular embodiment, but, where appli- 
cable, are interchangeable and can be used in a selected 
embodiment, even if not specifically shown or described. The 
same may also be varied in many ways. Such variations are 
not to be regarded as a departure from the invention, and all 
such modifications are intended to be included within the 
scope of the invention. 

What is claimed is: 

1. A method for adaptive control of nonminimum phase 
systems, said method comprising: 

receiving input data; 

distinguishing between said input data and modified data; 

determining control gains from said input data and esti- 
mates of nonminimum phase zeros using a modified 
quadratic cost criterion that uses a retrospective perfor- 
mance vector; 

calculating a control signal based on said control gains, 
said input data, and said retrospective performance vec- 
tor; and 

outputting said control signal. 

2. The method according to claim 1 wherein said determin- 
ing control gains comprises determining control gains using 
estimates of a relative degree, a first nonzero Markov param- 
eter, and nonminimum phase zeros. 

3. The method according to claim 1 wherein said determin- 
ing control gains comprises determining control gains using 
an explicit minimizer of said modified quadratic cost criterion 
to achieve a minimizer in a single operational step. 

4. The method according to claim 1 wherein said determin- 
ing control gains comprises determining control gains using a 
one-step learning penalty. 

5. The method according to claim 1 wherein said determin- 
ing control gains from said input data and estimates of non- 
minimum phase zeros using a modified quadratic cost crite- 
rion comprises a disturbance rejection response. 

6. The method according to claim 1 wherein said determin- 
ing control gains from said input data and estimates of non- 
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minimum phase zeros using a modified quadratic cost crite- 
rion comprises a command following response. 

7. The method according to claim 1 wherein said determin- 
ing control gains from said input data and estimates of non- 

5 minimum phase zeros using a modified quadratic cost crite- 
rion comprises a stabilization response. 

8. The method according to claim 1, wherein said deter- 
mining control gains further comprises adjusting a rate of 
convergence. 

to 9. A system for adaptive control of nonminimum phase 
systems, said system comprising: 

a device receiving input data; 

a device distinguishing between said input data and modi- 
fied data; 

1 5 a controller determining control gains from said input data 
and estimates of nonminimum phase zeros using a modi- 
fied quadratic cost criterion, said controller calculating a 
control signal based on said control gains and said input 
data and outputting said control signal. 

20 10. The system according to claim 9 wherein said control- 

ler determines control gains using a relative degree, a first 
nonzero Markov parameter, and nonminimum phase zeros. 

11. The system according to claim 9 wherein said control- 
ler determines control gains using estimated Markov param- 

25 eters to obtain estimates of a relative degree, a first nonzero 
Markov parameter, and nonminimum phase zeros. 

12. The system according to claim 9 wherein said control- 
ler determines control gains using an explicit minimizer of 
said modified quadratic cost criterion to achieve a minimum 

30 in a single operational step. 

13. The system according to claim 9 wherein said control- 
ler determines control gains using a one- step learning penalty. 

14. The system according to claim 9 wherein said control- 
ler employs a modified quadratic cost criterion to define a 

35 disturbance rejection response. 

15. The system according to claim 9 wherein said control- 
ler employs a modified quadratic cost criterion to define a 
command following response. 

16. The system according to claim 9 wherein said control - 

40 ler employs a modified quadratic cost criterion to define a 

stabilization response. 

17. The system according to claim 9, wherein said control- 
ler comprises a device for adjusting a rate of convergence. 



